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Abstract 

Consider a d-ary rooted tree (d > 3) where each edge e is assigned an i.i.d. (bounded) 
m^j \ random variable X{e) of negative mean. Assign to each vertex v the sum S(v) of X(e 



over all edges connecting v to the root, and assume that the maximum S 1 * of S(v) over all 
vertices v at distance n from the root tends to infinity (necessarily, linearly) as n tends 
to infinity. We analyze the Metropolis algorithm on the tree and show that under these 
assumptions there always exists a temperature 1/(3 of the algorithm so that it achieves a 
linear (positive) growth rate in linear time. This confirms a conjecture of Aldous |Ald98j . 
The proof is obtained by establishing an Einstein relation for the Metropolis algorithm 
on the tree. 
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m ■ 1 Introduction 



Given a d-regular rooted tree, attach to each edge e a random variable X(e), such that the 
variables are independent and identically distributed. For a vertex v in the tree, denote by 



V-i . S(v) the sum of the variables X(e) over all edges e on the path from the root to v. This 



defines a branching random walk, a basic model for a disordered tree. It is natural to ask 
for an efficient algorithm which explores the vertices of this tree in order to find vertices v 
with a large value of S{y). In fact, Aldous |Ald98j proposed this problem as a benchmark 
problem for comparing different generic optimization algorithms, since the naive approach, 
which would be to simply explore all vertices down to the level n in the tree and taking the 
one with the maximal value of S(v), is a bad choice for an algorithm because the number of 
vertices grows exponentially in n. 

The Metropolis algorithm is a general recipe for constructing a discrete-time Markov chain 
on a finite state space for which a given distribution ir is stationary and whose transitions 
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respect a given graph structure of the state space. In the context of the comparison of 
algorithms discussed earlier, Aldous |Ald98j suggested to use the Metropolis algorithm to 
"sample" a certain Gibbs measure on the vertices of a branching random walk tree, namely 
the one which assigns mass e@ s ( v ' to a vertex v, for some parameter j3 > 0. In the case 
where this measure is infinite, for example when there is an infinite number of vertices v with 
S(v) > 0, this algorithm should "walk down the tree" and, for an appropriate choice of the 
parameter /3, find vertices v with high values of S(v ). Let \v\ denote the level of the vertex v 
in the tree and let V& be the vertex visited by the Metropolis algorithm at the time k. Aldous 
raised the following natural question: If the maximum of the branching random walk has 
positive speed, i.e. if linin^oo max^| =n S(v)/n > 0, does there exist a choice of the parameter 
P, such that lim inf fc-^oo S(Vk)/k > 0? We will answer this question in the affirmative for a 
certain class of laws of the variables -^(e), including the binomial distribution. 

In fact, we show more: Let vp = lim^oo S(Vk)/k, which exists almost surely [Ald98j . We 
show that there exists a parameter (3q > 0, such that vp = and (du / g/d/3)| j g =j g = er 2 /2, 
where a 2 is the asymptotic variance of S(Vk), which we show to be positive and finite. This 
result was conjectured by Aldous, who gave heuristic arguments and numerical evidence for 
it (in the case where the variables X(e) only take the values 1 and —1). Results of this type 
are also known as Einstein relations in the domain of random walks in random environments 
and our methods of proof will indeed rely on many techniques from this field, some of which 
have been obtained recently. 

1.1 Definition of the model and statement of the main result 

We are given a d-regular infinite rooted tree, d > 3. The root is denoted by p and the 
level/depth of a vertex v in the tree by \v\. The notation u ~ v denotes that u and v are 
connected by an edge. The parent of a vertex v is denoted by v (with the convention p = p). 
We write u < v if u is an ancestor of v and u < v if u < v but m/d. We furthermore use the 
following handy notation: if u < v, then [u, v] denotes the set of vertices on the path from u 
to v, including u and v. The notation (u, v], [u,v) and (u,v) then has obvious meaning. To 
each edge e = (v,v), we then attach a random variable X(e), such that the collection (X(e)) 
is i.i.d. according to the law of a random variable X. Here, orientation of the edges matters, 
and we will set X(v,v) = —X(v,v) for all v. 

In what follows, we will introduce several assumptions, which we assume to hold 
throughout the paper. We begin with the following assumptions on the law of X. 

(XS) The law of X is of compact support, i.e. esssup \X\ < oo. 

(XR) There exists /3 > 0, such that E[e^ )X f{X)} = E[f(-X)\ for all bounded measurable 
functions /. 

(XM) inf^o A(/9) > 0, where A(/3) = logE[e? x ] + log(d - 1). 

An example for a law satisfying (XS), (XR) and (XM) is the distribution of 2Y — n, where 
Y follows a binomial distribution of parameters n and p, with p € (po,l/2), where po — 
(1 - v / l-(d-l) _2/ ")/2. In this case, fa = log ±=£. 

We remark that assumption (XS) seems to not be crucial and the argument extends to 
certain distributions with non compact support, at the cost of more complicated technical 



arguments. To avoid this complication we chose to present the result under this simplifying 
assumption. 

We now define the branching random walk by 

S(v)= J2 *fa«), S(p) = 0. (f.f) 

u£(p,v] 

Note that X(u,v) = S(v) — S(u) for every two vertices u and v with u ~ v, by the above 
convention that X(u, v) = —X(v, u). Since A(/3) is the log-Laplace transform of this branching 
random walk, it is known Big77| that lim n _ Sh00 maxu =n S{v)/n exists and is positive under 



assumption (XM). 

In order to define the Metropolis algorithm, we are given a function h : R + — > K satisfying 
the following conditions (examples are h(x) = min(l,x) and h{x) = x/(l + x)): 

(HI) h takes values in [0, 1], is non-decreasing and satisfies h(0) = and linx r _>. 00 h{x) = 1. 

(H2) It is Lipschitz-continuous and continuously differentiable on (0, 1) U (1, oo). 

(H3) It satisfies the functional equation h(x) = xh(l/x) for all x > 0. 

For a given realization of the branching random walk and a parameter (3 € R, the Metropo- 
lis algorithm is then the Markov chain (V n ) n >o on the vertices of the tree with the transition 
probabilities Pp(v,w) given by 

Pp{v,w) =pp{X(v,w)) for w ~ v, where pp{x) = -h(eS' 3o+l3 > x ) 



Pp(v,v) = 1 - Y^ p fi(v,w). 



We denote the (annealed, i.e. averaged over the environment) law of the Metropolis algorithm 
on the branching random walk tree by P^. Our main theorem is the following: 

Theorem 1.1. Set S n = S(V n ). 

1. The limit a 2 = lim n _ s . 00 S 2 /n exists F^-almost surely and is a strictly positive and finite 
constant. 

2. For each [3 € M, the deterministic limit vp = linin^oo S n /n exists Fp -almost surely and 
satisfies 

lim V -l = G -. (1.2) 

We note that the existence of vp and the fact that it vanishes at /? = were already shown 
in |Ald98j (in a slightly more restrictive setup, but the argument does carry over). The main 
novelty in Theorem 11.11 is the proof of the Einstein relation (|1.2|) , as well as the fact that the 
right side is strictly positive. 



1.2 Related works 

Our main inspiration, as noted above, is Aldous' work [AM98J. In that paper, Aldous makes 
the crucial obervation that a reversible invariant measure for the environment viewed from 
the point of view of the particle exists at f3 = 0, and derives from this that vq = 0, and 
the existence of the limit a 2 under Po; he also completely analyzes a greedy algorithm and 
formulates a series of conjectures, some answered here. In the same paper, Aldous also refers 
speculatively to JLR94] as relevant to the analysis near /3 = 0; indeed, the approach of the 
latter to proofs of the Einstein relation forms the basis of the current paper, as well to recent 
advances in the analysis of the Einstein relation for disordered systems, as we now discuss. 

The Einstein relation (ER) links the asymptotic variance of additive functionals of (re- 
versible) Markov chains in equilibrium to the chains' response to small perturbations. In a 
weak limit (where the time-scale is related to the strength of the perturbation), Lebowitz and 
Rost |LR94j provide a general recipe (based on the Kipnis-Varadhan theory, see |KLQ12] for 
a comprehensive account) for the validity of a weak form of the ER in disordered systems. 
For the tagged particle in the symmetric exclusion process, the ER was proved by Loulakis in 
d > 3 |Lou02j by perturbative methods (using transience in an essential way); this approach 
was adapted to bond diffusion in Z d in special environment distributions |KO05aj . For mixing 
dynamical random environments with spectral gap, a full perturbation expansion was proved 
in |KO05b| . 

Significant recent progress was achieved by [GMP12] . where the Lebowitz-Rost approach 
was combined with good uniform in the environment estimates on certain regeneration times 
in the transient regime, that are used to pass from a weak ER to a full ER. These uniform 
estimates are typically not available for random walks on (random) trees, and a completely 
different approach, based on explicit recursions, was taken in [BH OZllj . where (biased) ran- 
dom walks on Galton-Watson trees were analyzed. While we still consider walks on trees, 
the approach we take is closer to that of [GMP12] , while replacing their uniform regeneration 
estimates with probabilistic estimates, in the spirit of JPZ08] . See also |Guol2] for another 
approach to the proof of the ER in the context of balanced random walks. 

1.3 Overview of the proof and outline of the paper 

As discussed above, the starting point is Aldous' observation that under Po, the environment 
viewed from the point of view of the particle forms a reversible Markov chain. We begin 
by proving this (Proposition I2.2J ). and then apply the Kipnis-Varadhan theory to deduce an 
invariance principle for anti-symmetric additive functionals (Lemma 12. 4|) . This allows us to 
prove the weak ER, Theorem 12 .1\ following the Lebowitz-Rost recipe. 

To handle the perturbation, estimates on regeneration times and distances are crucial. 
We work with level regeneration times, that are introduced in Section [3J these involve the 
random walk location {V n }, not the vertices values {S n }; of course, the latter influence the 
transition probabilities of the random walk. In order to transform the weak ER to a full ER, 
we need uniform moments of the regeneration times. These are obtained in Proposition 14.11 
where it is proved that the regeneration times exhibit uniform annealed stretched-exponential 
bounds. The proof has two main steps: first, exponential moments are proved for regeneration 
distances, using in a crucial way a structure lemma of Grimmett and Kesten, see Lemma [4.31 
Then, the estimates for regeneration times are obtained, using that the walk must visit 
many well-separated fresh vertices, and between two such visits, the walk has a large enough 



probability to hit distant levels. In proving the last statement, an argument of Aidekon 
|Aid08] is used, see Lemma H~51 

Acknowledgement We thank David Aldous for bringing the problem of the analysis of the 
Metropolis algorithm on the tree to our attention. 

2 The weak Einstein relation 

In this section, we show that the Einstein relation holds for times of the order of f3~ 2 . Specif- 
ically, we will prove the following result: 

Theorem 2.1. 

1. E[S n ] = for all n > 0. Furthermore, the limit a 2 = hrrin^oo E[<S^]/n exists and is a 
finite, non-negative constant. 



2. Set S? = /3S L/ 3-2j . Then, 

Ep[SP]-t^r-, as P^O. 



a 2 



The proof of Theorem 12.11 uses a fairly generic and now classical change of measure argu- 
ment in the spirit of Lebowitz and Rost [LR94 . It uses the crucial concept of the environment 
seen from the particle, which we define as follows. 

Let Q be the space of rooted d-regular unlabelled trees ui with marked edges, i.e. to 
every two vertices u and v with u~t)we associate a real number A"^(u, v) with A w (i;,u) = 
— A^(u, v). Note that "unlabelled" means that we do not distinguish between the neighbors 
of a vertex, this will be crucial for what follows. The root of every tree w is denoted by p. 
For every vertex v , we then define the shift operator 9 V : £1 —■ Q, which yields the tree u 
"seen from the vertex v" . A bit more formalljo, if for a vertex u in lo we denote by 0~ 1 u its 
corresponding vertex in 6 v oj, then Xg vUJ (9~ 1 u, 0~ l w) = A^(u, w). Note that in particular, the 
mark of the edge (p, v) changes its sign. 

Define S^{v) for each vertex v analogously to (jl.ip . For w £ (1, we then define the 
operators C^ and L w acting on functions / : R — > R by 

C u f = Y,f(SJv)) = Y,f(X^ P ,v)) and L w / = C,(pf), 

where p{x) = po(x) = d~ 1 h(e^° x ) (see Section [TTT]) . 

Let P be the law on £1 under which all edges X u (u,u) are i.i.d. according to the law of 
X (see Section [TTT]) . We denote by E the expectation with respect to P. Furthermore, let 
PjS be the law of the Metropolis algorithm (V n ) n >o with transition probabilities Pa defined in 
Section 11.11 and the underlying tree ujq distributed according to P. Expectation w.r.t. P^ is 
denoted by E^, and we also set P = Po and E = Eo- Setting u: n = Oy n ojQ then defines a Markov 
chain (w n ) n >o on the space fi, which jumps from u to v uj with probability pp(X{p, v)) for 
every v ~ p. Let (T n ) n >o be the natural filtration of (w n ) n >o, augmented by sets of zero 



1 There are several ways how to render this completely formal, one of which consists of first defining the 
space of labelled trees with marks, defining the shift operator on this space and then taking the quotient space 
with respect to the group of graph automorphisms fixing the root. 



measure. The process S n = 5^0 (XO is then adapted to (J r n )n>o 5 since it can be almost surely 
reconstructed from ujq, . . . , uj n . 

The following result was already observed by Aldous [Ald98j , who had a more complicated 
proof for it. 

Proposition 2.2. The process {oJ n )n>o is reversible and ergodic under P. 

Proof. In order to show reversibility, since (w n ) n >o is a Markov process, we only have to show 
that E[i 71 (wo,<^i)] = ~^[F(loi, loq)] for every bounded measurable functional F : Q 2 — >■ R. For 
this, it is obviously enough to show that K[(F(ujo,uji) — F{u\, Wo))lu^wJ = 0. Now, we have 

E[F(w ,^i)W^] = J2 E fr( X "o(P>v))F(oj ,e v u Q )} 

= Y,nP(-XUP,v))e> SoX ^F(u,d v L J )], 



u~p 



where the last equality follows from assumption (H3). Conditioned on X u (p,v), the envi- 
ronment 9 v oj is distributed as w but with one edge pointing away from the root bearing the 
value — X w (p, v) (remember that the vertices are unlabelled, such that "it can be either one 
of them" ) . By assumption (XR) , the right-hand side of the last equation is therefore equal to 



VEb(X^,v))%, W )] = E[F(wi rt )l 






^OT^i J > 



which finishes the proof of the reversibility. 

Ergodicity follows from a classical ellipticity argument which we recall (see also [Zei04, 
Corollary 2.1.25] for a similar argument): Let Q be a stationary probability measure of the 
Markov chain (cj n ) n >o with Q<P. We wish to show that P <C Q, which will imply ergodicity 
since ergodic measures are the extremal points in the convex set of stationary probabilities. 
Define the event E = {dQ/dP = 0}. By invariance, E Q [P1 E ] = E Q [1 E ] = 0, where P 
is the transition kernel of the Markov chain. This further implies 1 E > P^-E, P-almost 
surely. Since the transition probabilities are strictly positive, we furthermore have 1 e (uj) > 
/( w )E»~p1e(^ w ) f° r some strictly positive function /, whence 1 e (uj) > max v ^ p t E (9 v u}). 
But since P is a product measure and therefore ergodic, Birkhoff's ergodic theorem now gives 
t E > P(E), which implies P(E) € {0, 1}. But Q < P by hypothesis, whence P(E) = 1. This 
finishes the proof. □ 

We recall the following basic fact about reversible processes. 

Lemma 2.3. For any bounded measurable junctionals F and G and every n > 0, we have 

E[F(wo, • • • , u n )G(u n )] = E[FK, . . . , wo)G(wo)]. 

We will need the following result about anti-symmetric additive functionals of reversible 
ergodic Markov processes. It is implicit in the proof of Theorem 2.1 in [DFGW89] and relies 
on a celebrated result from [KV86] (see also Chapters 1 and 2 in [KLQ12| for a comprehensive 
account of the theory) . 

Lemma 2.4. Let F : O 2 — > K be an anti- symmetric measurable functional, i.e. F(oj,oj') = 
F(uj',u)) for all uj,uj' € O, with ¥,[F(u)q, wi) 2 ] < oo. Define a sequence of random variables by 
Y n = Ylk=i F(oJk-i,Uk) for all n > 0. Then 



1. The time-variance a 2 = lim„_ s>00 -E[Y^] exists and is finite. 

2. There exists a square integrable martingale M n and a process A n , such that Y n = M n + 
A n and —j=A n converges to in L 2 . 
Gaussian variable with variance a 2 . 



A n and —i=A n converges to in L 2 . In particular, —j=Y n converges in law to a centered 



We are now ready for the 

Proof of Theorem \2.1\ First part: S\ is (almost surely equal to) an anti-symmetric func- 
tional in the sense of Lemma 12.41 In particular, IE [Si] = — E[Si] = by Lemma 12.31 whence 
E[S n ] = for all n > 0. The second statement follows from the first part of Lemma 12.41 

Second part: As mentioned in the beginning of the section, we will use a change of 
measure argument as in [LR94J . The basic idea is to write the Radon-Nikodym derivative 
of Pp with respect to P as an exponential martingale of the form exp(Z^ — ^An) for some 
martingale (Zn) n >o and to show that the pair (/3S , ^-2j , Zf g _ 2 , ) converges in law under P to a 

centered Gaussian vector (Gs,Gz) with covariance ~E[GsGz] = ^E[G|] = \a 2 . The theorem 
then follows from a standard change of measure argument for Gaussian variables. Here are 
the details: 

Step 0: Set AS n = S n+ i — S n for all n > 0. The Radon-Nikodym derivative of P^ with 
respect to P is given by 



fc=0 



p(AS k ) g V 1 - C^p 



log ^F^v + log ( - r: ) ^ k =u, h+1 



Yn- (2.1) 



Note that if 1 — C Uk p = 0, then uj k ^ 0Jk+i with probability 1, such that the second summand 
is well defined. We now set Z n = Y n — ^fe=i^[^fc — ^fc-il-^fc-i] and A n = — 2^^ =1 E[lfc — 
^fc-il-^fe-i]) such that Y n = Z n — ^A n and (Z n ) n >Q is a martingale with respect to the filtration 

{J^n)n>0- 

Step 1: We wish to show that the random variable Z" = Z\a-i\ converges in law under 
P to a centered Gaussian variable with variance o 2 z < oo. This will follow from the martin- 
gale central limit theorem for stationary ergodic sequences (see e.g. |Dur961 Theorem 7.5]) as 
soon as we show that Z n can be decomposed into a sum of two martingales M n + N n , with 
M n+ \ — M n being a stationary ergodic sequence of square-integrable martingale differences 
and f3N\p-2\ vanishing in probability, as (3 — > 0. Now, by assumptions (Hl-3), there exist 
measurable functions qp(x) and q(x), such that pp(x) = p(x)exp((3qp(x)), with qp(x) uni- 
formly bounded for all x € R and small enough /?, and qp(x) — > q(x) as (3 —> 0, for all x € P. 
Note that q(0) = qp(0) = by assumptions (Hl-3). We can then write the process Y n as 

Y n = 02>(A5*) + J>g ( 1 - -*&- 1 t Uk=0Jk+1 . (2.2) 

fc=o fc=o v l ^P J 

Note that by assumptions (Hl-3), there exists a constant c > 0, such that for small enough 
j3, we have \L u) (e° q ' 3 — 1)| < /3c(l — C u p) and \L w q\ < c(l — C u p). Defining the martingale 

n— 1 T 

M n = ^q(AS k ) - T -^-l a;fc=aJfc+i , 



we then have E[(Z n — (3M n ) 2 ] = o(/3 2 n), such that the above-mentioned martingale central 
limit theorem yields convergence of Z@ in law under P to a centered Gaussian variable with 
variance a\ = E[M X 2 ] = E[L w g 2 + (L w g) 2 /(1 - C u p)). 

Step 2: We wish to show that the random variable A 13 = A 13-21 converges in probability 
to a\. Expanding the second summand in (|2.2p up to terms of second order in /3, one sees 
that E[Ai] = /3 2 c?7 + o(/3 2 ). The convergence is then a consequence of the ergodicity of the 
process (w n ) n > . 

Step 3: Recall the definition S 13 = /35 , ^ / g-2j. We wish to show that the pair {S",Z") 
converges to a centered Gaussian vector (Gs, Gz) with covariance E[GsGz] = ^E[G|] = \o 2 . 
By Lemma 12.44 every linear combination aS n + bZ n is the sum of a martingale and a process 
A' n with -i^A' n — > in L 2 , as n — > oo. Convergence to a Gaussian vector with E[G|] = a 2 
then follows from the martingale central limit theorem ( |Dur96l Theorem 7.4]) together with 
the convergence of Z@ established above. 

It remains to show that linig_>o K[S^Z^] = cr 2 /2. In order to prove this, recall the definition 
of AS n = S n+ i — S n and define AZ n = Z n+ \ — Z n . We have for every n > 0, 

E[S 2 +1 - S 2 ] = E[(AS n ) 2 + 2S n AS n ] = E^Jx 2 ) + 2S n L Wn x], (2.3) 

where x is the identity function. By Lemma [273], we have E[S' n L a;n x] = K[(—S n )L wo x], whence, 
summing (|2.3p over n, we get, 



n k~2 

E[S 2 n ] = ^E[L W0 (x 2 ) - 2L W() x x £ AS,}. 

fc=l j=0 

Furthermore, since (Z n ) n >o is a martingale, we have 

E[S n+1 Z n+1 - S n Z n ] = E[Z n+l AS n ] = E [/3AS n (qp(AS n ) - h Wn q p ) + (L Un x)Z n ] , (2.4) 

where we made use of the fact that AS^ = on the event that uj^ = ujk+i- Applying 
Proposition 12.21 to the term E[(L Wn x)Z n ], we see that the terms corresponding to the second 
summand in the brackets of (|2.ip cancel. Summing (|2.4p over n, this yields 

n fc-2 

E[S n Z n ] = pJ2 E i L "°( x W) - W x Yl (^( As ^) - Qpi-ASj))] +E[5„_iL Wo ^]. 
fc=i i=o 

Now, by assumption (H3) we have qp(x) — qp{—x) = x for every x (This is the critical 
point!). Moreover, by reversibility, we have E[L aJ /] = E[L W /] for every function /, where 
J(x) = f(-x). This yields, 

E[L u (xq0)] = ~ X E[L w (x 9/3 - xqp)] = \ X E[L w (x 2 )]. 

Altogether, the previous equations now yield E[S^Z^} = E[(5 /3 ) 2 ]/2 + /3E[5 L/3 - 2 _ 1J L Wo ^]. 
Convergence of the first summand has been established above, and the second tends to by 
Lemma 12.41 and the Cauchy-Schwarz inequality. Hence, we obtain lim^_ Sh oE[S'^Z^] = a 2 /2 as 
claimed. 

Step 5: We claim that Ep[S p ] -> E[G s exp(G z - \o\% as /3 -»• 0. Since (G S ,G Z ) is a 
centered Gaussian vector with E[GsGz] = ^E[G|] = a 2 /2 and E[G|] = cr|, this will finish 
the proof of the theorem. By (|2.ip . E^fS""] = E[5'"exp(Z^ — ^A ^)] and by the convergences 



in law established above, it suffices to show that this last expression is uniformly integrable. 
Now, since Z n and exp(Z n — \A n ) are martingales, A n is a submartingale. A n being T n -v 
measurable, it is therefore increasing in n. It then remains to show that S'"exp(Z^) is 
uniformly integrable. Z n being a martingale with bounded increments, Azuma's inequality 
|Azu67j implies that all exponential moments of Z" are uniformly bounded in /?. Furthermore, 
E[(S'^) 2 ] is uniformly bounded by the first part of this theorem. Holder's inequality then 
yields uniform boundedness of E[(S^ exp(Z^)) c ] for some constant c > 1, which finishes the 
proof. □ 

3 Estimates on the branching random walk 

In this section, we establish an estimate for the branching random walk (Lemma 13.31 below) . 
We recall that for two vertices u, v with u < v, we denote by [u, v] the set vertices on the 
path connecting u and v. Similarly, if n, m G N, then we define [n, m] to be the set of vertices 
between levels/depths n and m. More generally, for a vertex u, we let [n,m] u denote the set 
of vertices between levels/depths n and m in the subtree rooted at u (which means that these 
vertices are between levels n + \u\ and m + \u\ in the original tree), such that [m, n] = [m, n] p . 
Finally, we write [n] for [n,n] and [n] u for [n,n] u . 

Lemma 3.1. There exist c G (0, oo) and b > 1, such that for large L, 

P(Vu G [L] : max \S(w)\ > clogL) < e" L \ 
w£[p,v] 

We will first establish the following intermediate bound: 

Lemma 3.2. There exist constants C\, C*2 > such that for all large L, 

V{3v G [L] : max \S(w)\ < C x ) > C 2 . 

w£[p,v] 

Proof. The proof is a standard first and second moment calculation. Fix C\ > large. Let 
(S n )n>o be a random walk starting at with steps distributed according to the law of X. 
For n > and x G [— C\,Ci], define the event Bn = {V/c < n : \Sk + x\ < C\} and set 
B n = Bn . By assumption (XM) and standard large and small deviations estimates, there 
exists Co < d — 1, such that for C\ large enough, 

3L G N VL > L Vn > : P(B n ) > c^ n . (3.1) 

Indeed, this is obtained e.g. by combining the change of measure in the Mogulskii-Varadhan 
theorem |DZ981 Theorem 5.1.2] with the fact that a centered random walk with bounded i.i.d. 
increments stays in a tube of width a for time n with probability at least e~ Cn ' a for all n 
and a > ao and some constant C > 0; for finer estimates see e.g. |Mog74 . 



In the sequel, we fix such a C\ once and for all. By an argument similar to the above, 
there exists a constant C[ depending on C\ only such that 

(C[)- 1 sup P(i#>) < P(B n ) < C[ inf P(flW) . (3.2) 

xel-CuCi] xe[-Ci/2,Ci/2] 

To see (|3.2p . note from the above that P(B n _c)/P(B n ) is bounded by a constant depending 
on C only, uniformly in n > uq(C), and then couple the walk started at x with the walk 
started at by time C, with a fixed positive probability. 



The second inequality in (|3.2p yields the existence of a constant C'{ (depending on C\ ) so 
that for every k < L, 

P{B L „ k )P(B k ) < C'{P{B L ), (3.3) 

because conditioned on B k , the probability that \S k \ < C\j1 is bounded from below by a 
strictly positive constant uniformly in k. 
For v £ [L], let 

A v = l{max u , e[PiV] |S(io)|<Ci} ■ 
Further let A = Y^ve[L] A v - Then, 

EL4] = (d- l) L P(Vn < L : \S n \ < C x ) = (d - 1) L P(B L ), 

As for the second moment, denote by u A v the most recent common ancestor of u and v. We 
then have for large L, 

EL4 2 ] = J2 HA V A U ] < E p (^) sup P(4-|«A«|) 

«,t;e[L] u,v6[i] xe[-Ci,Ci] 

«,r;e[L] 

where the last inequality follows from (|3.2[> and (J3.3)) . Equation (|3.ip now yields, 

E p(jw _1 < E 4r AU| <^-i) 2L : 

u,M5[L] u,uS[Z/] 

for some C > 0. The lemma now follows from the previous three inequalities together with 
the Paley-Zygmund bound P{A > 0) > EL4] 2 /EL4 2 ]. □ 

Proof of Lemma \3.1l Let c > and set H = [clogL]. Let C\ be as in Lemma 13,21 Let 
g = esssup |X|, which is finite by assumption (XS). The branching random walks spawned by 
the vertices at level H being independent, we have 



P(W € [L] : max \S(w)\ > gH + C x ) < P(Vt> € [L - H] : max \S(w)\ > Ci) ( 

wd[p,v\ wd[p,v\ 



-1)« 



The lemma now follows from the last inequality together with Lemma [3. 21 by choosing c large 
enough. □ 

Lemma 3.3. There exist c € (0, oo) and b > 1, such that for large L, 

P(3u € [0,L] \/v € [L] with u < v : max \S{w) - S{u)\ > clogL) < e~ L \ 

w£[u,v] 

Proof. Let c be as in the statement of Lemma 13. 11 We say that a vertex u is -ff-bad if for all 
v £ [H]u there exists w € [u,v], such that \S(w) — S(u)\ > clogL. Note that if u is H-b&d, 
then it is K-b&d for every K > H. A simple union bound gives: 

P(3u e [0, L] : u is (L - |u|)-bad) < P(3u € [0, L] : u is L-bad) < (d - l) L+1 P(p is L-bad). 

The statement then follows from Lemma 13. 11 D 
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4 Regeneration times 

In this section, we establish a regeneration structure for the Metropolis algorithm, which will 
permit us to prove Theorem 11.11 from the previously established Theorem 12.11 Recall the 
definition of the Metropolis algorithm (V n ) from Section [1.1| which depends on a parameter 
(5 € K. Define the level regeneration times (r n ) n >o by tq = and r n +i to be the first time 
after r n where the chain (V n ) n >o hits a level L for the first time, then immediately jumps to 
level L + 1 and never gets back to level L again. 

As in Section [21 we denote the law of the branching random walk by P, which is a law 
on 0,. We further denote the (quenched) law of the Metropolis algorithm (V n ) n >o started 
from the vertex v and given the branching random walk uj (the environment) by P% a- The 
annealed law is denoted by F v p(du,dV) = P(doj)P^(dV). We also set P^ = P£. Our goal is 
to show: 

Proposition 4.1. There exists a > and n a > such that for all n > n a and \/3\ < Pq/2, 
P/?(ti > n) < e~ na and P /3 (r 2 - n > n) < e" 71 ". 

Note that the existence of tail bounds for the regeneration times which are uniformly 
bounded near the critical point is in sharp contrast to other settings discussed in the literature, 
where the regeneration times usually blow up when the parameter approaches the critical value 
|BHOZlT| IGHSlT] . We remark that we actually only need that Erf is uniformly bounded for 
/3 in a neighborhood of 0, for some k > 2. We also note that the constant 1/2 in front of f5$ 
in the statement of Proposition 14.11 is arbitrary and could be replaced by any other constant 
less than one. 

In order to prove Proposition 14. 1[ we will make use of the relation between the Markov 
chain (V n ) n >o and electrical networks |LP92j : Let N(v) be the set of neighbors of v including 
v. For w 6 N(v), set 

Q(v,w) = ^p^- Q(v)= V Q(v,w) 

w£N(v) 

C(v,w)=Q(v,w)Y[Q(u,u) C{v)= Y, C(v,w). 

u<v w£N(v) 

One checks that for every w € N(v), C(v,w) = C(w,v) and that 

C(v,w)/C(v) = Q(v,w)/Q(v) = Pp{v,w), 

whence the Markov chain (V n ) n >o has an interpretation as the random walk on the rooted 
d-regular tree with loops, induced by the edge conductances C(v,w). By assumption (H3), 
one has for u < v, 

C(U,U) / l ( e (/3o+/3)^W) e ' l ■ > 

By assumptions (XS) and (Hl-3), this implies the existence of a constant c > 0, such that 

ce {Po+l3){S(tf)-S(u)) < C(v,v) < c -i e (,3 +i3)(s(v)-S(u))_ , A2 ^ 

C(u,u) 

In particular, since there exists by assumption (XM) almost surely an infinite ray vq, ■ ■ ■ , v n , . . . 
with liminf n _ ! . 00 S{v n )/n > 0, this ray has finite resistance, such that the chain is transient 
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for almost every oj and every (3 > — /3q, a fact which was already observed by Aldous |Ald98l 
Lemma 8]. 

Define Tl to be the first strictly positive time the chain (V n ) n >o hits the level L. Further- 
more, denote by T u and T* to be the first non-negative, respectively strictly positive time the 
chain hits a vertex u. The following important lemma controls hitting probabilities and will 
be used in the evaluation of escape probabilities. 

Lemma 4.2. There exist c,Lq>0, b > 1, such that for \/3\ < (3q/2 and L > Lq, 
P {3v G [1,L - 1] : P^(T L < T«) < L~ c ) < e~ L \ 

Proof. Let v G [1,L — 1] and let u G [L — \v\] v , such that \u\ = L. By (|4.2p . we have for a 
fixed environment w, for some c > 0, 

-i 



pZATu<t,)=[ £ -^4 >c £ e <*+W*C«)-*M) 



C(w,w) . 

,v<w<u I \v<w<u 



This gives for |/3| < /3 /2, 



P*JT L < Tg) > max P% JT V < T € ) > y max min JPo\S{w)-S(v)\ m 
ue[L-\v\] v L u&[L-\v\] v w£[v,u) 



The statement now follows from Lemma 13.31 □ 

For a vertex v, denote by £(v ) the depth of the first excursion below v after T v , i.e. 

l(v) = sup{|F n | - \v\ :n>T v and \V k \ > \v\ V/c G {T v + 1, . . . ,n}} GNU{oo}. (4.3) 

Note that since the probability of jumping from v to one of its children does not involve 
X(v,v), the event £(v) > is independent from X(v,v) (conditioned on T v < oo). 

Lemma 4.3. There exist a,Lo > 0, such that for |/3| < /3q/2 and L > Lq, 

P(P^A L < KP) < oo) > e~ aL ) < e~ aL . 

Proof. Fix e G (0, 1). For a vertex v, define the variable A(v) by A(v) = 1 if for one of i>'s 
sisters v, one has P^ JT^ = oo,T^ = oo) > e and A(y) = otherwise. In words, A{v) = 1 
if the (quenched) probability of a walk, started at an appropriate sister v of v, to escape 
to infinity through the subtree rooted at v without visiting again v is at least e. By (|4.2p . 
we can choose e such that EL4(i;)] > 1/2 for all |/3| < @q/2. By a result due to Grimmett 
and Kesten (see [DGPZ02, Lemma 2.2] for this version), there exist then a,j > 0, such that 
¥p{Gi) > 1 — e~ for large L, where 

G L = < min y^ A(w) > jL 
ve[L] ^ , 
^ J tue[2,-u] 

Now, let uj G G/,. We wish to bound P w r{L < £(p) < oo). For this, define T m for m = 
2, . . . , L — 1 to be the first time after Tl that the Markov chain (V n ) n >o hits level m. If 
T m < oo and ^4(Vr m ) = 1, then by assumptions (XS) and (Hl-3), the probability that 
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from Vt™ the chain reaches Vt™ after two steps is bounded from below by 5/e for some 5 
sufficiently small. It follows that for uj € Gl, 

PuA L < i{p) < oo) < (i - <j) E «-6[2.v T ] A ^ < (i - sy L . 

This yields the lemma (reducing the value of a if necessary) . □ 

Lemma 4.4. There exist a,Lo > 0, such that for \/3\ < /So/2 and L > Lq, 

Vp(\V T1 \>L)<e~ aL . 
Proof. Define a sequence of random numbers Lq, L\, . . .recursively as follows: 

• L Q = 1; 

• for n G N, let w n = Vr Ln • If £{v n ) < oo, then L n+ i = L n + £(u n ) + 1; 

• otherwise, set L m = oo for m > n. 

Let iV be the largest number n, such that L n < oo. Then by construction, \V T1 \ = L^. 
Furthermore, the differences (L n+ i — L n )o< n< N are independent and identically distributed 
as £ + 1 conditioned on £ < oo and N is geometrically distributed with success probability 
P(£ = oo) > (£ as in (14. 3D). The lemma then follows from Lemma 14.31 □ 



Lemma 4.5. Let Q n be the a-field generated by Vq, . . . ,V n and let T be a stopping time with 
respect to the filtration (Gn)n>0> such that Vp ^ Vk for all k < T. Then there exists a constant 
c > 0, such that for \/3\ < (3q/2 and all N > 0, we have 

Gt) > c 



max \Vj\ > cN 
\T<j<T+N J 

Proof. We follow the proof of [Aid08t Theorem 1.5]. Throughout the proof, cq,c\,... will 
denote positive constants which are uniform in |/3| < /?o/2. 

Step 1. For a vertex v ^ p, define ir^^iv) = P^ a{T$ = oo). Note that the random 
variables ^^^(v), u/p, are identically distributed under P (but not independent). Let 7r Wj/ g 
denote a random variable with this law. We wish to show that for some constant cq, 

E[l/7r W)/J ] < c . (4.4) 

Denote by vi, . . . ,i>d— l the children of the vertex v. By assumptions (XS) and (Hl-3), we 
have 

PZ,f}(yi=Vi)>c u i = l,...,d-l, (4.5) 

which yields n Uj p(v) > c\ Yli=\ n <v,p(vi)- Now, note that the variables ir u ,p( v i)i i = 1, ■ ■ ■ ,i — l 
are independent under P. The previous inequality then yields for every x > 0, 

P(7r w ,^(u) < a:) < P( max n u ,fi(vi) < xjc x ) = P(tt Wj/ }(t;) < x/df' 1 . (4.6) 

i=l,...,d— 1 

Furthermore, by ()4.2p and the remark following it, there exists a constant c-i, such that 

P(*W»(w) < 2c 2 ) < 1/2. (4.7) 
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Together with (|4.6p . this now easily implies (|4.4p . 

S'iep #. For a vertex v, let N v denote the number of times the vertex v has been visited 
by the Metropolis algorithm (y n ) n >Q. We wish to show that for each k > 0: 



E, 



H=fc 



< 



C3- 



(4.f 



Recall that T^ denotes the first strictly positive hitting time of the vertex v, such that 
KA N ^ = 1 / P ZA T v = oo). By g3|), we have P^(T* = oo) > d^fa), such that 

E P [N V ] = E [^(r« < oo)EZ t0 [N v ]] < Cl Fp{T v < oo)E[l/7r w ^(t;i)] < c 4 ¥ (T v < oo), (4.9) 
by (j4.4p . Furthermore, we have for every k > 0, 

1 > ^ P^ (T„ < oo, V n > Vl \Jn > T v ) > Cl Y, Wp(T v < oo)E[7r Wi/ »(wi)], (4.10) 

|u|=fc |^|=fc 



and by f|4.7|) . we have E[7r Wjy g(vi)] = E^^] > c 2 . Equations (|4.10|) and (|4.9p now yield 

S'tep 5. Recall the notation in the statement of the lemma and let L € N. Define the event 
Et = {V n > Vr Vn > T}. A straightforward extension of the proof of the last step permits to 
prove that for every constant C > 0, 



E T , Y, N w >CL\g T ) < -^-^ 

w£[V T ,L] 



we[v T ,L] 



Qt 



< c 3 C-\ 



Furthermore, by f|4.5j) . we have F p (Et | Qt) > ciE[7r Wjy g] > C1C2. This now yields for every 
constant c > and every N € N, 



St > 



max \V<i\ > c/V 

\ T<j<T+N J 



Setting c = C1C2/2C3 yields the proof. 



■we[V T ,cN] 



N W <N 



Qt > cic 2 - c 3 c. 



D 



Proof of Proposition \4-l\ The proof follows an argument in the spirit of [PZ081 Proposition 3] . 
Let L = L(n) go to infinity with n (we will later choose L = n for some constant b). We 
have, with Tl denoting the hitting time of level L, and with a > as in the statement of 
Lemma [ 



j /j(ti > n) < P/3(T|y Ti | > n, \V n \ < L) + P p (\V n | > L) < Fp(T L > n) + e 



-ctL 



(4.11) 



where in the last inequality we used the fact that Ty < Tf, for L' < L. 

Let c be the constant from Lemma 14.21 and set c = 6(c V 1). Throughout the proof, all 
constants will be uniform in the choice of /3 in a small neighborhood of 0. We write L = L c . 
Define a vertex v to be fresh if it is visited by the random walk for the first time before time 
L. We will upper bound ¥^(Tl > n) by showing that on the one hand, there cannot be too 
few fresh points that are well separated and on the other hand, if there are many such fresh 
points, it is unlikely that Tl is large. 
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For a vertex v, let N v denote the number of visits to v by time L and let iV" denote the 
number of times the random walk visits v before time L and then, at the next step that it 
moves, it visits the ancestor of v before time L. Clearly, for each v, N™ < N v , while, using 
the Markov property, there exist constants 7,7' > such that 



(3v : N v > L l/2 ,N% < 7 L 1/2 ) < Le^'^ . (4.12) 



On the other hand, the event 

{N V >L 1/2 ,NZ> 7 L 1/2 ,T L >L} 

-, — 1/2 

implies that the random walk visited v and then hit v at least 7L times. By Lemma 

and the Markov property, the probability that there exists a fresh vertex v satisfying the last 

event is bounded above by 

1(1 - L-y zl/a < Le-^ E/2 - c < Ie-^ 2(cV1) . 
Combining this with (|4.12p . we conclude that for all large L, 

F[s(3v : N v > L 1/2 ,T L >L)< e~ L . (4.13) 

— 1/2 — 1/2 

On the event {Vv : N v < L }, there are at least L fresh points, and therefore there 

are at least L fresh points that are L -separated. Let c be the constant from Lemma [4.51 
At each arrival to such a fresh point, with (annealed) probability at least c the walk hits level 

L before time L/c < L , for large L. It follows that 

P^W : N v < L 1/2 ,T L >L)< Cf /4 , (4.14) 

for some Cq € (0, 1). By (J4.13P and (|4.14p . there exists now a constant b > 0, such that with 
L = n , we have Fr(Tl > n) < 2e . Together with (|4.1ip . this completes the proof of the 
first statement of Proposition 14.11 

As for the second statement, let pi, . . . , pd-i be the children of the root. The law of the 
subtree of the branching random walk tree rooted at V Tl is equal in law under P^ to the 
subtree rooted at V\, conditioned on V\ € {pi, . . . , Pd-2} and on T* = 00. In particular, 

P/3(r 2 -n>n)= Fp(Ti > n\ V x G { Pl , . . . ,/> d _ 2 }, T* p = 00) < P /3 (r 1 > n)/ Cl c 2 , 

where c\ and c 2 are the constants from the proof of Lemma 14.51 This finishes the proof. □ 

Proof of Theorem ! 1 . 1[ Existence and finiteness of the limit a 2 = lim n _ >00 S 2 /n follows from 
the first part of Theorem 12.11 In order to prove the remaining statements, we will use the 
regeneration structure established in this section. Note that the random vectors {(ri+i — 
T i,S Ti+1 — S n )}i>i are i.i.d. under the law Fp and independent from (n,<5 n ). Furthermore, 
by (XS), \S n — S m \ < g\n — m\, where g = esssup |X|. Proposition 14.11 and the P-almost sure 
convergence of S^/n to a 2 now yields 

2 ls „ S 2 k k __E[(S T2 -S T1 ^ 



sl 


k 


lim — -=- 




k— >oo K 


Tk 



^ l im _U, . _ = ^_i ii- > 0, 

E[r 2 - rij 
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which proves the first part of the theorem. 

As for the second part, by standard arguments (see e.g. the proof of Theorem 4.1 in 
|LPP96j ). V/3 = lining oo S n /n exists almost surely if K^[t2 — T\\ < oo, which is the case for 
small f$ by Proposition 14 . 1 1 (but the proof of Lemma [4. 51 actually yields this in the case of every 
(3, such that (V n ) n >o is transient, and in the recurrent case we have vr = 0). Furthermore, 
we have vp = ~Ep[S T2 — 5 n ]/E j g[r2 — t±], which implies in particular that 1^1 < g. 

Now let Kp = inf{/c > : r^ > /3~ 2 }. By the optional stopping theorem, we have 

^&[ S T Kp \ = ^I-SVJ + vpEp[TKf) - n], 

such that by Proposition 14.11 and assumption (XS), for some constant C > 0, 

\Ep[Sp-2] - vp(3~ 2 \ < CEj3[T Kfj - P~ 2 }. 

Crude moment bounds using Proposition 14.11 yield that the right-hand side of the above 
equation is o(/3~ 1 ), which yields lirng_>o vp//3 = lim j g_ s .oE / 3[S' / g-2] = a 2 /2 by Theorem 12.11 
This finishes the proof. □ 
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